Picture for Zeru Shi

Zeru Shi

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Add code
May 14, 2026
Viaarxiv icon

Reinforcing Consistency in Video MLLMs with Structured Rewards

Add code
Apr 01, 2026
Viaarxiv icon

Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Improving Visual Reasoning with Iterative Evidence Refinement

Add code
Mar 14, 2026
Viaarxiv icon

Read the Scene, Not the Script: Outcome-Aware Safety for LLMs

Add code
Oct 05, 2025
Viaarxiv icon

Robustness-aware Automatic Prompt Optimization

Add code
Dec 24, 2024
Viaarxiv icon

SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement

Add code
Dec 09, 2024
Figure 1 for SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement
Figure 2 for SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement
Figure 3 for SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement
Figure 4 for SeFENet: Robust Deep Homography Estimation via Semantic-Driven Feature Enhancement
Viaarxiv icon

Dual Adversarial Resilience for Collaborating Robust Underwater Image Enhancement and Perception

Add code
Sep 03, 2023
Figure 1 for Dual Adversarial Resilience for Collaborating Robust Underwater Image Enhancement and Perception
Figure 2 for Dual Adversarial Resilience for Collaborating Robust Underwater Image Enhancement and Perception
Figure 3 for Dual Adversarial Resilience for Collaborating Robust Underwater Image Enhancement and Perception
Figure 4 for Dual Adversarial Resilience for Collaborating Robust Underwater Image Enhancement and Perception
Viaarxiv icon